Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 12227 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Numeric | 13 |
|---|---|
| Boolean | 1 |
| Categorical | 3 |
release_date has a high cardinality: 3859 distinct values | High cardinality |
id is uniformly distributed | Uniform |
id has unique values | Unique |
instrumentalness has 3602 (29.5%) zeros | Zeros |
key has 1481 (12.1%) zeros | Zeros |
Reproduction
| Analysis started | 2021-03-10 11:40:35.016994 |
|---|---|
| Analysis finished | 2021-03-10 11:41:06.630829 |
| Duration | 31.61 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 12227 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8094.03435 |
|---|---|
| Minimum | 1 |
| Maximum | 16227 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 789.3 |
| Q1 | 4026 |
| median | 8093 |
| Q3 | 12180 |
| 95-th percentile | 15409.7 |
| Maximum | 16227 |
| Range | 16226 |
| Interquartile range (IQR) | 8154 |
Descriptive statistics
| Standard deviation | 4690.929822 |
|---|---|
| Coefficient of variation (CV) | 0.57955398 |
| Kurtosis | -1.202594167 |
| Mean | 8094.03435 |
| Median Absolute Deviation (MAD) | 4077 |
| Skewness | 0.001697230668 |
| Sum | 98965758 |
| Variance | 22004822.59 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 5408 | 1 | < 0.1% |
| 3371 | 1 | < 0.1% |
| 1322 | 1 | < 0.1% |
| 7465 | 1 | < 0.1% |
| 5416 | 1 | < 0.1% |
| 11559 | 1 | < 0.1% |
| 15653 | 1 | < 0.1% |
| 13604 | 1 | < 0.1% |
| 3363 | 1 | < 0.1% |
| Other values (12217) | 12217 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 |
| Value | Count | Frequency (%) |
| 16227 | 1 | |
| 16225 | 1 | |
| 16224 | 1 | |
| 16223 | 1 | |
| 16222 | 1 |
acousticness
Real number (ℝ≥0)
| Distinct | 2714 |
|---|---|
| Distinct (%) | 22.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4305783602 |
|---|---|
| Minimum | 1.04 × 106 |
| Maximum | 0.996 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 1.04 × 106 |
|---|---|
| 5-th percentile | 0.00102 |
| Q1 | 0.05895 |
| median | 0.354 |
| Q3 | 0.805 |
| 95-th percentile | 0.989 |
| Maximum | 0.996 |
| Range | 0.99599896 |
| Interquartile range (IQR) | 0.74605 |
Descriptive statistics
| Standard deviation | 0.3668928922 |
|---|---|
| Coefficient of variation (CV) | 0.8520931987 |
| Kurtosis | -1.512941893 |
| Mean | 0.4305783602 |
| Median Absolute Deviation (MAD) | 0.3316 |
| Skewness | 0.2615280658 |
| Sum | 5264.681611 |
| Variance | 0.1346103944 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.995 | 156 | 1.3% |
| 0.994 | 119 | 1.0% |
| 0.993 | 82 | 0.7% |
| 0.991 | 71 | 0.6% |
| 0.992 | 68 | 0.6% |
| 0.99 | 49 | 0.4% |
| 0.989 | 45 | 0.4% |
| 0.986 | 43 | 0.4% |
| 0.996 | 41 | 0.3% |
| 0.984 | 38 | 0.3% |
| Other values (2704) | 11515 |
| Value | Count | Frequency (%) |
| 1.04 × 106 | 1 | |
| 1.08 × 106 | 1 | |
| 1.17 × 106 | 1 | |
| 1.2 × 106 | 1 | |
| 1.34 × 106 | 1 |
| Value | Count | Frequency (%) |
| 0.996 | 41 | 0.3% |
| 0.995 | 156 | |
| 0.994 | 119 | |
| 0.993 | 82 | |
| 0.992 | 68 |
danceability
Real number (ℝ≥0)
| Distinct | 898 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.556352654 |
|---|---|
| Minimum | 0 |
| Maximum | 0.98 |
| Zeros | 13 |
| Zeros (%) | 0.1% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.248 |
| Q1 | 0.438 |
| median | 0.569 |
| Q3 | 0.685 |
| 95-th percentile | 0.827 |
| Maximum | 0.98 |
| Range | 0.98 |
| Interquartile range (IQR) | 0.247 |
Descriptive statistics
| Standard deviation | 0.175372545 |
|---|---|
| Coefficient of variation (CV) | 0.315218313 |
| Kurtosis | -0.3514219069 |
| Mean | 0.556352654 |
| Median Absolute Deviation (MAD) | 0.123 |
| Skewness | -0.2899234856 |
| Sum | 6802.5239 |
| Variance | 0.03075552954 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.632 | 40 | 0.3% |
| 0.611 | 38 | 0.3% |
| 0.665 | 37 | 0.3% |
| 0.501 | 36 | 0.3% |
| 0.621 | 36 | 0.3% |
| 0.623 | 36 | 0.3% |
| 0.49 | 35 | 0.3% |
| 0.606 | 34 | 0.3% |
| 0.576 | 34 | 0.3% |
| 0.628 | 34 | 0.3% |
| Other values (888) | 11867 |
| Value | Count | Frequency (%) |
| 0 | 13 | |
| 0.0608 | 3 | < 0.1% |
| 0.0612 | 1 | < 0.1% |
| 0.0622 | 1 | < 0.1% |
| 0.0625 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.98 | 2 | |
| 0.978 | 1 | |
| 0.974 | 1 | |
| 0.971 | 1 | |
| 0.968 | 1 |
energy
Real number (ℝ≥0)
| Distinct | 1396 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5221287124 |
|---|---|
| Minimum | 2.03 × 105 |
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 2.03 × 105 |
|---|---|
| 5-th percentile | 0.09496 |
| Q1 | 0.303 |
| median | 0.534 |
| Q3 | 0.739 |
| 95-th percentile | 0.93 |
| Maximum | 1 |
| Range | 0.9999797 |
| Interquartile range (IQR) | 0.436 |
Descriptive statistics
| Standard deviation | 0.2624822911 |
|---|---|
| Coefficient of variation (CV) | 0.5027156807 |
| Kurtosis | -1.055906763 |
| Mean | 0.5221287124 |
| Median Absolute Deviation (MAD) | 0.217 |
| Skewness | -0.09056870595 |
| Sum | 6384.067767 |
| Variance | 0.06889695314 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.412 | 26 | 0.2% |
| 0.538 | 26 | 0.2% |
| 0.512 | 24 | 0.2% |
| 0.701 | 24 | 0.2% |
| 0.614 | 24 | 0.2% |
| 0.574 | 24 | 0.2% |
| 0.593 | 23 | 0.2% |
| 0.431 | 23 | 0.2% |
| 0.537 | 23 | 0.2% |
| 0.553 | 23 | 0.2% |
| Other values (1386) | 11987 |
| Value | Count | Frequency (%) |
| 2.03 × 105 | 1 | |
| 7.46 × 105 | 1 | |
| 0.000261 | 1 | |
| 0.000281 | 1 | |
| 0.00121 | 1 |
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 0.999 | 2 | < 0.1% |
| 0.998 | 2 | < 0.1% |
| 0.997 | 7 | |
| 0.996 | 7 |
explicit
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 10906 | |
| True | 1321 | 10.8% |
| Distinct | 3658 |
|---|---|
| Distinct (%) | 29.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1493205587 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 3602 |
| Zeros (%) | 29.5% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.000115 |
| Q3 | 0.05565 |
| 95-th percentile | 0.895 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.05565 |
Descriptive statistics
| Standard deviation | 0.2979543138 |
|---|---|
| Coefficient of variation (CV) | 1.995400476 |
| Kurtosis | 1.579457133 |
| Mean | 0.1493205587 |
| Median Absolute Deviation (MAD) | 0.000115 |
| Skewness | 1.804604491 |
| Sum | 1825.742471 |
| Variance | 0.0887767731 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3602 | |
| 0.918 | 17 | 0.1% |
| 0.862 | 16 | 0.1% |
| 0.892 | 16 | 0.1% |
| 0.908 | 16 | 0.1% |
| 0.896 | 15 | 0.1% |
| 0.929 | 15 | 0.1% |
| 0.898 | 14 | 0.1% |
| 0.891 | 14 | 0.1% |
| 0.893 | 14 | 0.1% |
| Other values (3648) | 8488 |
| Value | Count | Frequency (%) |
| 0 | 3602 | |
| 1 × 106 | 3 | < 0.1% |
| 1.01 × 106 | 6 | < 0.1% |
| 1.02 × 106 | 6 | < 0.1% |
| 1.03 × 106 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 0.998 | 2 | |
| 0.997 | 1 | < 0.1% |
| 0.995 | 1 | < 0.1% |
| 0.991 | 1 | < 0.1% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.205201603 |
|---|---|
| Minimum | 0 |
| Maximum | 11 |
| Zeros | 1481 |
| Zeros (%) | 12.1% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 11 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.526953879 |
|---|---|
| Coefficient of variation (CV) | 0.6775825698 |
| Kurtosis | -1.274939405 |
| Mean | 5.205201603 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.01892504914 |
| Sum | 63644 |
| Variance | 12.43940366 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1481 | |
| 7 | 1464 | |
| 2 | 1336 | |
| 9 | 1262 | |
| 5 | 1170 | |
| 1 | 1037 | |
| 4 | 923 | |
| 10 | 842 | |
| 11 | 825 | |
| 8 | 732 | |
| Other values (2) | 1155 |
| Value | Count | Frequency (%) |
| 0 | 1481 | |
| 1 | 1037 | |
| 2 | 1336 | |
| 3 | 498 | 4.1% |
| 4 | 923 |
| Value | Count | Frequency (%) |
| 11 | 825 | |
| 10 | 842 | |
| 9 | 1262 | |
| 8 | 732 | |
| 7 | 1464 |
liveness
Real number (ℝ≥0)
| Distinct | 1477 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.201364562 |
|---|---|
| Minimum | 0.0147 |
| Maximum | 0.997 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0.0147 |
|---|---|
| 5-th percentile | 0.0578 |
| Q1 | 0.0962 |
| median | 0.132 |
| Q3 | 0.252 |
| 95-th percentile | 0.5977 |
| Maximum | 0.997 |
| Range | 0.9823 |
| Interquartile range (IQR) | 0.1558 |
Descriptive statistics
| Standard deviation | 0.1739874923 |
|---|---|
| Coefficient of variation (CV) | 0.8640422651 |
| Kurtosis | 5.314608221 |
| Mean | 0.201364562 |
| Median Absolute Deviation (MAD) | 0.051 |
| Skewness | 2.212065103 |
| Sum | 2462.0845 |
| Variance | 0.03027164747 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.111 | 132 | 1.1% |
| 0.109 | 112 | 0.9% |
| 0.11 | 112 | 0.9% |
| 0.103 | 111 | 0.9% |
| 0.106 | 110 | 0.9% |
| 0.105 | 110 | 0.9% |
| 0.113 | 109 | 0.9% |
| 0.102 | 106 | 0.9% |
| 0.107 | 102 | 0.8% |
| 0.112 | 101 | 0.8% |
| Other values (1467) | 11122 |
| Value | Count | Frequency (%) |
| 0.0147 | 1 | |
| 0.015 | 1 | |
| 0.0162 | 1 | |
| 0.0166 | 1 | |
| 0.0185 | 1 |
| Value | Count | Frequency (%) |
| 0.997 | 1 | |
| 0.994 | 1 | |
| 0.993 | 1 | |
| 0.992 | 2 | |
| 0.991 | 1 |
loudness
Real number (ℝ)
| Distinct | 8718 |
|---|---|
| Distinct (%) | 71.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -10.66868651 |
|---|---|
| Minimum | -43.738 |
| Maximum | 1.006 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | -43.738 |
|---|---|
| 5-th percentile | -21.118 |
| Q1 | -13.656 |
| median | -9.584 |
| Q3 | -6.5715 |
| 95-th percentile | -3.9239 |
| Maximum | 1.006 |
| Range | 44.744 |
| Interquartile range (IQR) | 7.0845 |
Descriptive statistics
| Standard deviation | 5.506888135 |
|---|---|
| Coefficient of variation (CV) | -0.5161730198 |
| Kurtosis | 2.16120605 |
| Mean | -10.66868651 |
| Median Absolute Deviation (MAD) | 3.4 |
| Skewness | -1.199524572 |
| Sum | -130446.03 |
| Variance | 30.32581693 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| -6.901 | 6 | < 0.1% |
| -7.509 | 6 | < 0.1% |
| -7.355 | 6 | < 0.1% |
| -10.605 | 6 | < 0.1% |
| -6.784 | 6 | < 0.1% |
| -4.255 | 5 | < 0.1% |
| -10.21 | 5 | < 0.1% |
| -10.199 | 5 | < 0.1% |
| -10.322 | 5 | < 0.1% |
| -6.086 | 5 | < 0.1% |
| Other values (8708) | 12172 |
| Value | Count | Frequency (%) |
| -43.738 | 1 | |
| -43.469 | 1 | |
| -42.001 | 1 | |
| -41.786 | 1 | |
| -41.594 | 1 |
| Value | Count | Frequency (%) |
| 1.006 | 1 | |
| -0.029 | 1 | |
| -0.574 | 1 | |
| -0.795 | 1 | |
| -0.923 | 2 |
mode
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.6 KiB |
| Major | |
|---|---|
| Minor |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 61135 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Major |
|---|---|
| 2nd row | Major |
| 3rd row | Minor |
| 4th row | Major |
| 5th row | Major |
| Value | Count | Frequency (%) |
| Major | 8487 | |
| Minor | 3740 |
| Value | Count | Frequency (%) |
| major | 8487 | |
| minor | 3740 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 12227 | |
| o | 12227 | |
| r | 12227 | |
| a | 8487 | |
| j | 8487 | |
| i | 3740 | 6.1% |
| n | 3740 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48908 | |
| Uppercase Letter | 12227 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 12227 | |
| r | 12227 | |
| a | 8487 | |
| j | 8487 | |
| i | 3740 | 7.6% |
| n | 3740 | 7.6% |
| Value | Count | Frequency (%) |
| M | 12227 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61135 |
Most frequent character per script
| Value | Count | Frequency (%) |
| M | 12227 | |
| o | 12227 | |
| r | 12227 | |
| a | 8487 | |
| j | 8487 | |
| i | 3740 | 6.1% |
| n | 3740 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61135 |
Most frequent character per block
| Value | Count | Frequency (%) |
| M | 12227 | |
| o | 12227 | |
| r | 12227 | |
| a | 8487 | |
| j | 8487 | |
| i | 3740 | 6.1% |
| n | 3740 | 6.1% |
| Distinct | 3859 |
|---|---|
| Distinct (%) | 31.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.6 KiB |
| 01-01-1961 | 90 |
|---|---|
| 01-01-1962 | 88 |
| 01-01-1992 | 85 |
| 01-01-1998 | 84 |
| 01-01-1990 | 82 |
| Other values (3854) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 122270 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2256 ? |
|---|---|
| Unique (%) | 18.5% |
Sample
| 1st row | 01-01-1947 |
|---|---|
| 2nd row | 13-11-2020 |
| 3rd row | 01-01-1950 |
| 4th row | 30-04-1974 |
| 5th row | 01-01-1973 |
| Value | Count | Frequency (%) |
| 01-01-1961 | 90 | 0.7% |
| 01-01-1962 | 88 | 0.7% |
| 01-01-1992 | 85 | 0.7% |
| 01-01-1998 | 84 | 0.7% |
| 01-01-1990 | 82 | 0.7% |
| 01-01-1945 | 82 | 0.7% |
| 01-01-1940 | 80 | 0.7% |
| 01-01-1958 | 79 | 0.6% |
| 01-01-1987 | 78 | 0.6% |
| 01-01-1956 | 78 | 0.6% |
| Other values (3849) | 11401 |
| Value | Count | Frequency (%) |
| 01-01-1961 | 90 | 0.7% |
| 01-01-1962 | 88 | 0.7% |
| 01-01-1992 | 85 | 0.7% |
| 01-01-1998 | 84 | 0.7% |
| 01-01-1990 | 82 | 0.7% |
| 01-01-1945 | 82 | 0.7% |
| 01-01-1940 | 80 | 0.7% |
| 01-01-1958 | 79 | 0.6% |
| 01-01-1987 | 78 | 0.6% |
| 01-01-1956 | 78 | 0.6% |
| Other values (3849) | 11401 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 29094 | |
| 0 | 25638 | |
| - | 24454 | |
| 9 | 12345 | |
| 2 | 10056 | 8.2% |
| 8 | 3914 | 3.2% |
| 6 | 3739 | 3.1% |
| 7 | 3702 | 3.0% |
| 5 | 3343 | 2.7% |
| 3 | 3201 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 97816 | |
| Dash Punctuation | 24454 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 29094 | |
| 0 | 25638 | |
| 9 | 12345 | |
| 2 | 10056 | 10.3% |
| 8 | 3914 | 4.0% |
| 6 | 3739 | 3.8% |
| 7 | 3702 | 3.8% |
| 5 | 3343 | 3.4% |
| 3 | 3201 | 3.3% |
| 4 | 2784 | 2.8% |
| Value | Count | Frequency (%) |
| - | 24454 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 122270 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 29094 | |
| 0 | 25638 | |
| - | 24454 | |
| 9 | 12345 | |
| 2 | 10056 | 8.2% |
| 8 | 3914 | 3.2% |
| 6 | 3739 | 3.1% |
| 7 | 3702 | 3.0% |
| 5 | 3343 | 2.7% |
| 3 | 3201 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 122270 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 29094 | |
| 0 | 25638 | |
| - | 24454 | |
| 9 | 12345 | |
| 2 | 10056 | 8.2% |
| 8 | 3914 | 3.2% |
| 6 | 3739 | 3.1% |
| 7 | 3702 | 3.0% |
| 5 | 3343 | 2.7% |
| 3 | 3201 | 2.6% |
speechiness
Real number (ℝ≥0)
| Distinct | 1275 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09767980698 |
|---|---|
| Minimum | 0 |
| Maximum | 0.968 |
| Zeros | 13 |
| Zeros (%) | 0.1% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.0279 |
| Q1 | 0.0347 |
| median | 0.0456 |
| Q3 | 0.0789 |
| 95-th percentile | 0.349 |
| Maximum | 0.968 |
| Range | 0.968 |
| Interquartile range (IQR) | 0.0442 |
Descriptive statistics
| Standard deviation | 0.155894608 |
|---|---|
| Coefficient of variation (CV) | 1.595975799 |
| Kurtosis | 18.11328828 |
| Mean | 0.09767980698 |
| Median Absolute Deviation (MAD) | 0.0139 |
| Skewness | 4.10021255 |
| Sum | 1194.331 |
| Variance | 0.02430312881 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.0321 | 51 | 0.4% |
| 0.0319 | 49 | 0.4% |
| 0.0343 | 48 | 0.4% |
| 0.0363 | 47 | 0.4% |
| 0.0287 | 47 | 0.4% |
| 0.0305 | 46 | 0.4% |
| 0.0324 | 46 | 0.4% |
| 0.0337 | 46 | 0.4% |
| 0.0339 | 46 | 0.4% |
| 0.0333 | 45 | 0.4% |
| Other values (1265) | 11756 |
| Value | Count | Frequency (%) |
| 0 | 13 | |
| 0.0223 | 1 | < 0.1% |
| 0.0226 | 1 | < 0.1% |
| 0.0228 | 2 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.968 | 1 | < 0.1% |
| 0.967 | 2 | < 0.1% |
| 0.965 | 6 | |
| 0.964 | 5 | |
| 0.963 | 11 |
tempo
Real number (ℝ≥0)
| Distinct | 11264 |
|---|---|
| Distinct (%) | 92.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118.1674949 |
|---|---|
| Minimum | 0 |
| Maximum | 216.843 |
| Zeros | 13 |
| Zeros (%) | 0.1% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 75.3244 |
| Q1 | 95.0505 |
| median | 116.915 |
| Q3 | 136.1085 |
| 95-th percentile | 174.5798 |
| Maximum | 216.843 |
| Range | 216.843 |
| Interquartile range (IQR) | 41.058 |
Descriptive statistics
| Standard deviation | 30.20006382 |
|---|---|
| Coefficient of variation (CV) | 0.2555699759 |
| Kurtosis | -0.01355677422 |
| Mean | 118.1674949 |
| Median Absolute Deviation (MAD) | 20.875 |
| Skewness | 0.4147768412 |
| Sum | 1444833.96 |
| Variance | 912.0438545 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 13 | 0.1% |
| 128.01 | 8 | 0.1% |
| 125.005 | 5 | < 0.1% |
| 120.005 | 5 | < 0.1% |
| 106.991 | 4 | < 0.1% |
| 130 | 4 | < 0.1% |
| 128.007 | 4 | < 0.1% |
| 123.997 | 4 | < 0.1% |
| 130.029 | 4 | < 0.1% |
| 119.993 | 4 | < 0.1% |
| Other values (11254) | 12172 |
| Value | Count | Frequency (%) |
| 0 | 13 | |
| 39.875 | 2 | < 0.1% |
| 42.49 | 1 | < 0.1% |
| 44.9 | 1 | < 0.1% |
| 46.329 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 216.843 | 1 | |
| 216.096 | 1 | |
| 216.083 | 1 | |
| 215.023 | 1 | |
| 212.242 | 1 |
valence
Real number (ℝ≥0)
| Distinct | 1256 |
|---|---|
| Distinct (%) | 10.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5253000728 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 17 |
| Zeros (%) | 0.1% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.09893 |
| Q1 | 0.321 |
| median | 0.532 |
| Q3 | 0.737 |
| 95-th percentile | 0.933 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.416 |
Descriptive statistics
| Standard deviation | 0.2582046985 |
|---|---|
| Coefficient of variation (CV) | 0.4915375265 |
| Kurtosis | -1.023706984 |
| Mean | 0.5253000728 |
| Median Absolute Deviation (MAD) | 0.208 |
| Skewness | -0.08203179183 |
| Sum | 6422.84399 |
| Variance | 0.06666966631 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.961 | 41 | 0.3% |
| 0.964 | 34 | 0.3% |
| 0.962 | 34 | 0.3% |
| 0.967 | 33 | 0.3% |
| 0.965 | 32 | 0.3% |
| 0.966 | 28 | 0.2% |
| 0.54 | 26 | 0.2% |
| 0.963 | 25 | 0.2% |
| 0.357 | 24 | 0.2% |
| 0.96 | 24 | 0.2% |
| Other values (1246) | 11926 |
| Value | Count | Frequency (%) |
| 0 | 17 | |
| 1 × 105 | 7 | |
| 0.00554 | 1 | < 0.1% |
| 0.00558 | 1 | < 0.1% |
| 0.0154 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0.993 | 1 | |
| 0.99 | 1 | |
| 0.989 | 1 | |
| 0.988 | 1 |
year
Real number (ℝ≥0)
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1984.517298 |
|---|---|
| Minimum | 1920 |
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 1920 |
|---|---|
| 5-th percentile | 1937 |
| Q1 | 1966 |
| median | 1987 |
| Q3 | 2008 |
| 95-th percentile | 2019 |
| Maximum | 2021 |
| Range | 101 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 25.91199777 |
|---|---|
| Coefficient of variation (CV) | 0.01305707831 |
| Kurtosis | -0.8089325033 |
| Mean | 1984.517298 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | -0.4107160383 |
| Sum | 24264693 |
| Variance | 671.4316286 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2020 | 466 | 3.8% |
| 2018 | 311 | 2.5% |
| 2019 | 288 | 2.4% |
| 2017 | 265 | 2.2% |
| 2016 | 258 | 2.1% |
| 2015 | 211 | 1.7% |
| 2013 | 199 | 1.6% |
| 2002 | 197 | 1.6% |
| 1999 | 187 | 1.5% |
| 1998 | 185 | 1.5% |
| Other values (92) | 9660 |
| Value | Count | Frequency (%) |
| 1920 | 13 | |
| 1921 | 4 | < 0.1% |
| 1922 | 5 | < 0.1% |
| 1923 | 5 | < 0.1% |
| 1924 | 9 |
| Value | Count | Frequency (%) |
| 2021 | 111 | 0.9% |
| 2020 | 466 | |
| 2019 | 288 | |
| 2018 | 311 | |
| 2017 | 265 |
duration-min
Real number (ℝ≥0)
| Distinct | 172 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.888132821 |
|---|---|
| Minimum | 0.2 |
| Maximum | 72.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 95.6 KiB |
Quantile statistics
| Minimum | 0.2 |
|---|---|
| 5-th percentile | 1.9 |
| Q1 | 2.9 |
| median | 3.6 |
| Q3 | 4.4 |
| 95-th percentile | 6.7 |
| Maximum | 72.8 |
| Range | 72.6 |
| Interquartile range (IQR) | 1.5 |
Descriptive statistics
| Standard deviation | 2.383133109 |
|---|---|
| Coefficient of variation (CV) | 0.6129248199 |
| Kurtosis | 283.0939223 |
| Mean | 3.888132821 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | 12.46312956 |
| Sum | 47540.2 |
| Variance | 5.679323415 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.1 | 489 | 4.0% |
| 3.2 | 487 | 4.0% |
| 3.4 | 483 | 4.0% |
| 3 | 458 | 3.7% |
| 3.3 | 457 | 3.7% |
| 3.6 | 457 | 3.7% |
| 3.5 | 449 | 3.7% |
| 3.7 | 434 | 3.5% |
| 2.9 | 419 | 3.4% |
| 3.9 | 404 | 3.3% |
| Other values (162) | 7690 |
| Value | Count | Frequency (%) |
| 0.2 | 3 | < 0.1% |
| 0.3 | 6 | < 0.1% |
| 0.4 | 5 | < 0.1% |
| 0.5 | 11 | |
| 0.6 | 23 |
| Value | Count | Frequency (%) |
| 72.8 | 2 | |
| 66.9 | 1 | |
| 62.2 | 1 | |
| 60.3 | 1 | |
| 59.3 | 1 |
popularity
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.6 KiB |
| very low | |
|---|---|
| low | |
| average | |
| high | |
| very high |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.664431177 |
| Min length | 3 |
Characters and Unicode
| Total characters | 69259 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | very low |
|---|---|
| 2nd row | low |
| 3rd row | very low |
| 4th row | low |
| 5th row | average |
| Value | Count | Frequency (%) |
| very low | 3222 | |
| low | 3118 | |
| average | 2912 | |
| high | 2606 | |
| very high | 369 | 3.0% |
| Value | Count | Frequency (%) |
| low | 6340 | |
| very | 3591 | |
| high | 2975 | |
| average | 2912 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9415 | |
| v | 6503 | |
| r | 6503 | |
| l | 6340 | |
| o | 6340 | |
| w | 6340 | |
| h | 5950 | |
| g | 5887 | |
| a | 5824 | |
| y | 3591 | 5.2% |
| Other values (2) | 6566 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65668 | |
| Space Separator | 3591 | 5.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 9415 | |
| v | 6503 | |
| r | 6503 | |
| l | 6340 | |
| o | 6340 | |
| w | 6340 | |
| h | 5950 | |
| g | 5887 | |
| a | 5824 | |
| y | 3591 | 5.5% |
| Value | Count | Frequency (%) |
| 3591 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 65668 | |
| Common | 3591 | 5.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 9415 | |
| v | 6503 | |
| r | 6503 | |
| l | 6340 | |
| o | 6340 | |
| w | 6340 | |
| h | 5950 | |
| g | 5887 | |
| a | 5824 | |
| y | 3591 | 5.5% |
| Value | Count | Frequency (%) |
| 3591 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69259 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 9415 | |
| v | 6503 | |
| r | 6503 | |
| l | 6340 | |
| o | 6340 | |
| w | 6340 | |
| h | 5950 | |
| g | 5887 | |
| a | 5824 | |
| y | 3591 | 5.2% |
| Other values (2) | 6566 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| id | acousticness | danceability | energy | explicit | instrumentalness | key | liveness | loudness | mode | release_date | speechiness | tempo | valence | year | duration-min | popularity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2015 | 0.949 | 0.235 | 0.0276 | No | 0.92700 | 5 | 0.5130 | -27.398 | Major | 01-01-1947 | 0.0381 | 110.838 | 0.0398 | 1947 | 3.0 | very low |
| 1 | 15901 | 0.855 | 0.456 | 0.4850 | No | 0.08840 | 4 | 0.1510 | -10.046 | Major | 13-11-2020 | 0.0437 | 152.066 | 0.8590 | 2020 | 2.4 | low |
| 2 | 9002 | 0.827 | 0.495 | 0.4990 | No | 0.00000 | 0 | 0.4010 | -8.009 | Minor | 01-01-1950 | 0.0474 | 108.004 | 0.7090 | 1950 | 2.6 | very low |
| 3 | 6734 | 0.654 | 0.643 | 0.4690 | No | 0.10800 | 7 | 0.2180 | -15.917 | Major | 30-04-1974 | 0.0368 | 83.636 | 0.9640 | 1974 | 2.4 | low |
| 4 | 15563 | 0.738 | 0.705 | 0.3110 | No | 0.00000 | 5 | 0.3220 | -12.344 | Major | 01-01-1973 | 0.0488 | 117.260 | 0.7850 | 1973 | 3.4 | average |
| 5 | 14384 | 0.898 | 0.498 | 0.4420 | No | 0.00319 | 10 | 0.0974 | -9.481 | Major | 01-01-1968 | 0.0337 | 109.619 | 0.3550 | 1968 | 2.6 | low |
| 6 | 954 | 0.259 | 0.620 | 0.7580 | No | 0.00132 | 5 | 0.4160 | -8.183 | Major | 13-11-1942 | 0.0343 | 119.258 | 0.9120 | 1942 | 2.4 | very low |
| 7 | 5930 | 0.124 | 0.879 | 0.6280 | Yes | 0.00000 | 1 | 0.0661 | -6.668 | Minor | 01-01-2005 | 0.2640 | 150.105 | 0.7210 | 2005 | 3.5 | average |
| 8 | 11900 | 0.149 | 0.697 | 0.1840 | Yes | 0.00000 | 2 | 0.0763 | -23.303 | Minor | 01-01-1945 | 0.9330 | 133.997 | 0.6130 | 1945 | 1.6 | very low |
| 9 | 14498 | 0.470 | 0.587 | 0.5660 | No | 0.00000 | 9 | 0.0644 | -9.932 | Major | 01-01-1999 | 0.0276 | 76.054 | 0.5290 | 1999 | 7.7 | high |
Last rows
| id | acousticness | danceability | energy | explicit | instrumentalness | key | liveness | loudness | mode | release_date | speechiness | tempo | valence | year | duration-min | popularity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 12217 | 2521 | 0.9520 | 0.1110 | 0.139 | No | 0.847000 | 7 | 0.173 | -25.052 | Minor | 01-01-1995 | 0.0465 | 176.447 | 0.08350 | 1995 | 1.2 | average |
| 12218 | 8280 | 0.9450 | 0.4920 | 0.122 | No | 0.868000 | 11 | 0.108 | -19.844 | Minor | 26-11-1971 | 0.0702 | 131.646 | 0.59900 | 1971 | 3.0 | low |
| 12219 | 13205 | 0.1370 | 0.4080 | 0.922 | No | 0.447000 | 7 | 0.983 | -9.745 | Major | 14-11-1979 | 0.0526 | 110.407 | 0.34200 | 1979 | 3.4 | low |
| 12220 | 10864 | 0.2630 | 0.6530 | 0.609 | No | 0.001010 | 11 | 0.233 | -7.519 | Minor | 26-06-2001 | 0.0370 | 95.982 | 0.48200 | 2001 | 3.5 | high |
| 12221 | 15234 | 0.9090 | 0.4350 | 0.433 | No | 0.963000 | 2 | 0.118 | -20.343 | Minor | 25-01-2011 | 0.0348 | 179.923 | 0.26000 | 2011 | 1.6 | average |
| 12222 | 15343 | 0.0408 | 0.8090 | 0.801 | No | 0.000000 | 1 | 0.353 | -5.461 | Major | 01-07-2014 | 0.4070 | 81.940 | 0.74400 | 2014 | 3.4 | average |
| 12223 | 1701 | 0.9120 | 0.4510 | 0.240 | No | 0.000002 | 1 | 0.175 | -14.014 | Major | 01-01-1959 | 0.0351 | 134.009 | 0.70100 | 1959 | 2.0 | very high |
| 12224 | 3351 | 0.3280 | 0.5510 | 0.564 | No | 0.002950 | 2 | 0.352 | -9.298 | Minor | 01-01-1984 | 0.0338 | 124.883 | 0.89000 | 1984 | 2.5 | low |
| 12225 | 8879 | 0.1220 | 0.0608 | 0.939 | No | 0.991000 | 1 | 0.912 | -26.324 | Major | 09-01-2017 | 0.1180 | 73.234 | 0.00558 | 2017 | 3.1 | high |
| 12226 | 9711 | 0.0380 | 0.3890 | 0.768 | Yes | 0.000000 | 1 | 0.119 | -4.765 | Major | 24-07-2020 | 0.2560 | 90.146 | 0.33400 | 2020 | 3.1 | high |